Ultravox V0 5 Llama 3 3 70b
MIT
Ultravox is a multimodal voice large language model built upon Llama3.3-70B and Whisper, supporting both voice and text inputs, suitable for scenarios like voice agents and translation.
Audio-to-Text
Transformers Supports Multiple Languages